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Abstract — The yiM of low voltage digital drcuits is found to 
. be sensitive to local gate delay variatipns due to uncorrelated 
intra-die parameter deviations* Caused by statistical deviations 
of the doping concentration they lead to more pronounced delay 
variations for minimum transistor sizes. Thdr influence on path 
delays in digital drcuits is verified using a cany select adder test 
drcoit fabricated in <b5 and 035 /im comj^ementaiy metal-ozide- 
semicondnctor (CMOS) technolc^es with two different threshold 
voltages. The increase of the path delay variations for smaUer 
device dimendons and reduced supply voltages as well as the 
dependence on the path length Is shown. It is found that drcuits 
with a iar^e nnmbo' of critical paths and with a low logic depth 
are most sensitive to uncorrelated gate delay variations. Scenarios 
for future technologies show the increased Impact of uncorrelated 
deh^ variations on dtgital design. A reduction of the maximal 
dpdc frequency of 10% Is found f6r e.g. highly pipdined systems 
realized in a 0.18-/xm CMOS technology. 

index Terms — Gate delay variations, low-voltage digital de- 
sign, path delay variatioiis, yidd, parameter variations, pipelined 
drcuits, scaling, SRAM, Vth variations. 



I. Introduction 

FOR an increasing range of applications, low power dis- 
sipation is an integrated circuit (IQ feature as important 
as area and speed. In the case of portable systems, low power 
circuits allow longer periods of operation, For IC*s embedded 
in high performance systems, a low power dissipation reduces 
the expenses' for chip cooling <x increases the reliability due 
to the lower chip temperature: 

The main contribution to the power dissipation is the 
dynamic power for charging the capacitive loads. Ac- 
cording to .. 



Pdyn OC Cload ' 



(i) 



in the case of static CMOS circuits, where Vop is the supply 
voltage and Cioaa is the load capacitance (including the 
capacitances for the wiring, the junctions, and the transistor 
gates), the power dissipation can be significantly reduced by ' 
scaling the supply voltage. Additionally, power can be saved 
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by reducing the capacitive load Cioad with, e.g., using small 
device sizes. 

The reduction of the supply voltage faces two disadvan- 
tages; first, the gate delays are increased. To first order, the 
gate delay d^ate of ^ involer can be described by 



fiCox(W/L)(VoD-V,u)'' 



(2) 



(3) 



where /i is the mobility, Vth is die threshold voltage, C7coc is 
.the ga^ f»pacitaxice per unit area, W and L .are tiie transistor 
dimensibiis, aiid dr has a value between two and onic^ depending 
on the short channel effect [1]. Second, a reduced supply 
voltage leads to an increased sensitivity 5J* of gate delays 
to parameter variations influencing Vii according to 

Vth dds^ ^ a-Vth 

dgate ■ ^Vth (VDD -Vthy 

Furthermore, transistors with small gate sizes im^ease the 
effect of geometry dependent parameter variations. 

For conventional supply voltages, i.e., Vjdjd > 3-4 x Kh 
only interdie parameter vatiadons are to be considered in 
digital designs. These variations act globally on the entire 
chip OT on functional blocks, so that each device on one 
chip or in one block shows the $»sx^ deviation. Inter-chip or 
interrblock variatioiis can be caused by systematic efifects like 
process gradients over the wafer [2] with typical distances 
in the range of functional block sizes or above. Variations 
of die gate oxide thickness can, e.g., be regarded as global 
variations. Sets of worst-case and best-case parameters are 
used during design verifications to cover the influence of these 
global variations. Effects where each device on the chip is 
affected differently are called intra-die or local variations. 
They are overlaid to the global variations and are caused 
by e.g. proximity effects or statistical variations of doping 
concentrations. For current technologies using standard device 
sizes, they are small. Therefore, for die design of digital 
cucuits operating at conventional supply voltages they are 
regarded to be negligible. 

The in^)act of global parameter variation on the perfor- 
mance aiKl power dissipation is investigated in, e.g., [3] 
and [2], [4], respectively. Since small dimension devices 
operating at low supply voltages show an increased sensitivity 
to parameter variations the impact of local variations may 
become significant This has to be investigated for issues 
like skew sensitive signals, testing methods, design for yield. 
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and perfonnance estimation. Therefore, for the design of low 
voltage digital circuits the effect of intra-diie local parameter 
variations has to be quantified. 

In this paper, the influence of local Kb-variations on path 
delays for different supply voltages, path lengths, transistor 
sizes, and technologies is investigated. Measurements are 
performed on a carry select adder test circuit realized in a 
0.5 and 035 CMOS technology to verify this influence. 
To demonstrate the impact on the design of low voltage 
digital circuits, the influence on yield and on performance 
estimation of pipelined systems is studied. To account for 
future CMOS technologies the influence of local variations 
on the circuit performance is calculated using technology 
parameters according to the SIA (Semiconductor Industry 
Association) road map [5]. 

The papGT is or^^ized as follows. In Section n, data of 
intra-die Vth^variations are presented and their impact on gate 
delays and path delays is given. The test circuit and the 
measurement results of path delay variations are shown in 
Section m. In Section IV, the increase of gate delay variations 
in future technology {kuroibters isl^ 
the design is shown for yield evaluation and performance 
estimation of pipelined architectures and SRAM's in Section 
. 5. The results are sunmiarized in the conclusion. The detailed 
derivations of the gate and path delay variances are given in 
the Appendix. 

n. Vth- Variations and Their 
Impact oh Gate and Path Delays 

According to (2), the main parameter leading to an increase 
of the sensitivity 5 is the tlu^hold voltage Kb- Therefore, 
investigations of Vth- variations of small digital transistors are 
of high importance in the design of low voltage digital circuits 
[6], [7]. Previous measurements i n [6] sho w uncorrelated intra- 
die Vbi- variations with the 1 / VWLeff-dependence (Fig. 1). 
Even transistors with quite difTerent W/L^s ratios li ke 0.7/4 ,4, 
1.8/1.7, and 7.5/0.4 bill with the same gate area 1/y/WL^ ^ 
0.57 /im"^^ are showing similar standard deviations. Therefore, 
the variations are dominated by an area depenctent parameter. 
Variations of the oxide thickness tcx can be excluded due to 
the effect, that variations of Kb results in parallel shifts of 
the input characteristics. Moreover, to detect proximity effects 
different surroundings in polysilicon coverage and fleld oxide 
areas were investigated but are shown to be negligible for 
a 0.5 /xm technology. This means, that the Kb-variatipns 
are dominated by the statistical deviations of the doping 
concentration in the channel area due to the implantation 
process [6]. They can be approximated with 

«2 



(4) 



where Qb is the depletion charge density, Wj. is the depletion 
layer width, N is the substtate doping, Di is the channel 
doping implant dose, q the elementary charge, W is the 
transistor channel width, and Lts is the effective channel 

length. 

The influence of (Tv^,. on the delay lime of single inverters 
with different transistor dimensions and supply voltages was 
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Fig. 1. Standard deviation of Kh versus {W Len)"^^^ • The error bars are 
obtained from experimental results. The dotted lines represent least square fits 
to demonstrate the linear {WL^)~^^^ dependence. 
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Hg. 2. Average delay variations of the low-high transitions and the high-low 
transitions {Wp = 2Wn) of single inverters with three diffeiem widths 
normalized to the nominal delay obtained from SPICE simulations usin£ 
measured . values. 



investigated by means of SPICE simulations using Vih±3avti, 
values. The variation of the delay, normalized to the nominal 
delay is shown in Fig. 2 demonstrating the increase of de!a> 
variations at low supply voltages and smaller dimensions. The 
delay of an inverter realized with small transistors shows h 
relative delay variations due to local effects witfi ^Jate,^^ - 

agate,^/^te = 5% Or of <f^^ = 3% f OT VdD = ll ^ 

(«2.1 X Vth) and Vi>/> = 1.5 V(fts2.9 x Vtu), respectively. 

The distance dependent correlation of the the gate delay oi 
the chip is given by the autoconelation coefficient pa{S) [8 
with 



(5 



where dgate(^) i$ the delay of the xih gate in a padi com 
prising n gates. As the variatioas are caused due to statistice 
deviations of the doping concentrations, no area or distanc 
S dependent correlation of the gate delays dgau exist. Th 
autocorrelation coefficient pa(S) can be set to Pa{^) ~ 0- Thi 
has to be compared with Pa{S) ^ 1 when only global effect 
are taken into account, ^th the variance of a single gat 
^g&te variance of a path comprising n gates is given s- 
[sec Appendix, (17)1 
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Fig.. 3. Test stniciure of 24 bit carry select adder (2-2-4-4-6-6 block stjuc- 
ture) for path delay measurements. 



The relative delay variation of a path is reduced for longer 
paths when, e.g., usin^dentical gates according to 
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To summarize the resuU of this section: First, local Vth* 
variations are clearly dcinihafed by the1db|>ihg variations. For 
functional blocks with size in the order of the size of our 
test structure (640 x 640 glpba] variations as,' e.g., 
variations of oxide thickness, gate lengthy and others are of 
minor importance. Second, due to the local Vth- variations an 
increase of the relative path delay variations is expected for 
shorter paths and for gates using small devices. 

m. T^ CflRcurr AND Experimental Results 

To verify the influence of the local Kh variations on logic 
circuits, a test circuit was designed, comprising a 24 bit cany 
select adder typically used in data path structures. The circuit is 
shown in Hg. 3 and is operated as follows: switching the signal 
jtf/ccr controlling the two multiplexer MUXl and MUX2, new 
24 bit patterns are applied to the adder inputs. The strobe 
signal allows the storage of the 25 bit result in LATCH- The 
skew betvi^een strobe and (he select signal facilitate the precise 
determination of individual path delays. 

The test circuit was fabricated with two different tech- 
nologies. Firstly* with an 0.5 \m\ process with two threshold 
voltages: for wafers of type I, Ku == l/2(Vtb« + Kh^) equals 
520 mV and for wafers of type II, Kb equals 290 mV. 
Secondly, wafers were processed with an 0.35 /im technology 
(type III) where Ki, equals 400 mV. On each chip, the adder 
is realized using different transistor dimensions: for wafers of 
type I and II one with WjL = 0.7^m/p.5 and one with 
WjL = 1,4 /im/0.5 for the smallest transistors used. For 
wafers of type m the ratios equal to WfL ^ 0.5 /iin/0.35 ^m 
and to W/L = 1.0 /um/0.35 /im. For die p-channel devices a 
channel width of H^p = 2 x Wn is taken. 

The design allows the concurrent measurement of identical, 
non overlapping, adjacent paths on the same chip. For the 
measurennent of intra-die path delay .deviations two different 
path lengths are chosen (Eng. 4): four paths having a logic 
depth of eight gates and 12 paths coinprising four gates. 
Significant statistical data are obtained by the compilation of 
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Fig. 4. Concurrent measuiemeni of identical, non overlapping paths is 
possible, e.g.. four paths with logic depth 8 or 12 paths with logic defytb 4. 
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Fig. 5. Path delay variations normalized to the nominal delay for two difTer- 
ent path lengths and transistor dimensioos, showing the incieased variations 
for lower supply voltages, smaller diipeostons and reduced logic depth. The 
circuits in (a) of ^ype I) are processed for Kh = 520 mV whereas in (b) for 
Vth = 290 roV. Representative for one cinve* the enor bars are siiown. 



all path delay deviations normalized to the local mean value 
^tpatu measured on several chips and wafers. Long distance 
effects are excluded. 

The results are shown in Fig. 5 I) and II) for wafers of 
type I and n, respectively. For both wafer types, an increase 
of the relative path delay variations for smaller transistor 
dimensions and reduced supply voltages is found. For the 
same V^oy the low Kh circuits (type II) show a reduced delay 
variation compared to type I. This is due to the higher value 
of (Vdd — Vui)r Regarding the same Vbx>/Vth ratios f both 
wafer types, nearly the same nonnalized delay variations are 
obtained (see Fig. 6). This can be explained by (4), where 
the sensitivity only depends on the ratio of Vbo/Kh- 
The variations for both wafers reach a value of up to 9% for 
{dmB^ - dmin)/daom ^OT a path with 4 gates operating at a 
supply voltage !//>/> = 2.1 x VJi, (Fig. 6). 

The delay variations versus Vbp/Kh for wafers of type 
III are shown in Fig. 7. The same trend as with a 03 
technology can be observed, but a higher value of the relative 
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Fig. 6. The relative path delay variations versus Voo/Kh show the.similar 
values for structures realized with different threshold voltages. 
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Fig. 7. The leladve path delay vaiiations versus Vbp/Vth for 035 iim 
tedmology. TTie variatioiis are increased cooqured lo a OJ itm lechnolaey 
due to increased Vth vaiiatioos. 



TABLE ! 

Device TIechnology PARAMEitRs. Where tox b the 
Gate Oxide Thickness and 5 Is ihe Scaling Factok 



Parameter 


Technology 




0.S 


035 Min 0.25 


0.18 


I'eflr [Mm) 


0.4 


028 0.20 


0.14 


tox [nml 


95 


80 .60 


45 


s 


1 


0.7 0.5 


0.36 



variations at the same Vbi^/Vth ratio has to be regarded. 
This is due to increased Vih- variations for smaller devices 
and higher doping concentrations and will be explained in the 
next section. 

The experimental results verify the assumptions of 
Section II. The relative path delay variations increase for 
smaller devices and reduced supply voltages and depend on 
the logic depth according to (7) confirming the domination of 
the local Vtb-variations. 

IV. Impact of Scaling on Delay Variations 

The observation Chat the delays of adjacent paths can deviate 
significandy has to be regarded in the design of digital low 
voltage circuits for actual and, moreover, for future CMOS 
technologies where the variations tend to increase. To account 
for future CMOS technologies the influence of local variations 
on the circuit performance is calculated using technology 
parameters according to 4he SIA road ma^ (Table 1). 

The Kh-variation with <7v»j. « *ox • N^^^ • {l/WL^y^^ 
according to [6], [7] is calculated with the technology param- 
eters given in Table I and scaling the impurity concenu-ation 
N and the channel width W with I/5 [9] and s. respectively. 
For gates comprising small transistors and operated at Vod = 
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Fig. 8. Expected increase of the local delay vaiiations crjate.«» (dashed lin 
for future technologies partitioned into effects according to aoping variaiio 
(solid line) and process instabilities. A slight decrease for the doping variaCiot 
of a'0.35 urn technology is observed. This is due to the usage of a dual wo' 
function technology with the p-channel devices as surface channel device 
This results in reduced Vth^ variations. The dots show experimental result 



2.1 X Vth the trend of local delay variations is shown in Fig. 
by the solid line. For a 0.18 ^ technology, <^Jatci^^ increast 
solely due to the physical effect of doping variations by 
factor of two (5 10%). 

The dots show experimental results. :The , gap^betwet 
predicted variations and measurements for the 6.35 ^m tec' 
nology can be explained by additional effects due to proce 
instabilities. Especially for the more advanced t6chnologi< 
uncorrelated variations due to instabilities like e.g. transist 
width deviations because of proximity effects [6] have to 1 
taken into account 

The estimated contribution of this effect is shown symbo- 
cally by the shaded area. For our process, its estimated trend 
given by the dashed line. The resulting delay variation amour 
to <^l^i^ — 15% for ^0A% fjm technology. 

The authors would like to point out that the solid li 
presents the lower limit of variations which exist due to t' 
unavoidable doping variations. 

V. Impact of Local Delay Varl\tions 
on Performance and Yield 

Local delay variations have to be considered in the desi 
of low voltage circuits, whwe the impact gets significx^ 
compared to the effect of standard global variations. Accordi 
to the results of the previous sections, circuits having sh 
critical paths will suffer most from the influence of local del 
variations. Short critical paths exists, e.g., in highly pipelir 
circuits. 

Taking into account the global parameter variations on 
the gate delay values of the critical path follow a nom 
probability distribution function (pdf) ^gate^o(^te><7^te^ 
where usually a 3 crjj,teg,, -value of 50% is used. With i 
distribution function -Fpatbji^ critical path, the desigi 
can determine, e.g., the maximum applicable clock frequer 
l/^cUc to achieve a specified yield. 

In the case of low voltage circuits, an additional saf 
margin to account for the local effects has to be introduc 
Uncorrelated gate delay^ variations result in a broadei 

distribu tion iVgate,,.^,^ (^g ^te, ^^ate,,,^^,,)> ^^^h (T^^te^^, 



y. Wiih A^gate...^,.. the path dt 
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local effects 




Fig, 9. (a) Additional uncprrelated gate delay variadons lead to a broadened 
of the yield, (b) Due lo local effects the yield depends also on the 
nuniber of critical paths whkfa results in a shift of the distributiOD function 



comprising n gates can 
in the Appendix .with- 



be calculated according to (26) 

= y(^U^3' + [K.t.u.)Vnl (see Fig. 9). 

In the case of one critical padi the corresponding distribution 
function Fp^th^'*^, governs the yield of the circuit 

If the local effects are small (<^gatc,^^ ^ <'is- 
tribution represents also the yield for a circuit, having c 
critical paths. In that case, the yield is independent of c. 
With <^lf,i^^>0%, the c critical paths show unconelated 
delay variations and yield is reduced [Fig. 9(b)] coinpared to 

Instead of the pdf for a single path 
1 



.g-0.5{(cfp.u.-5p,th)V(«'p«h,,,+lo*)''] 



(8) 



the pdf of a set of c critical paths has to be used given as the 
joint pdf of the path delays with 

1 



/(«Wih) = 



[(2^)-.|C|li/2 



(9) 



where C^cxc) is the covariance matrix. To approximate 
the corresponding distribution function i^ciw:,io+i^(*cik) = 
/ffTO/(rfpatii) Monte Carlo simulations are used. 

Examples of circuits witfi a high number of critical paths 
are data path structures, especially multiplier units. Depending 
on the architecture for a 32 bit multiplier based on carry save 
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Fig. 10. Parametric yield normalized to the nominal path delay for sets 
of c = 1. 34, 100, and 1024 critical paths con^rising a logic depth of 
n = 132, 68, 36. and 8, respectively, for regarding global and local effects 
^^s«te|pc ^ ^^'^ KgardiBg only the global effects^ The intef sbcfw 

the entire distribution function of A^patfa^io- 
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Bg. 11. Small section of the distribution functions for the path delays 
. noraiaiized to the nominal delays arc shown for an exan4>Ie with c = 1024 
paths for values of ly^^t^^^ between 0 and 15%. 



adders the number c of critical paths can reach values of up to 
32 X 32 = 1024. For that case, a logical depth of n == 8 gates is 
obtained for each critical path. This represents the situation for 
one full add^ cell within each pipeline stage, comprising four 
gates for the fiill adder block, two gates for the AMD-gate, 
and two gates for each latch. 

For cr^t^ = ^% and using unit delays the distribution 
functions of the path delays normalized to the nominal delay, 
5p»th*"' ^ shown in Fig. 10 for values of n = 8 and 
c = 1024. For comparison the functions for c = 1, 34, and 100 
are included and the parametric yield regarding only global 
effects is also given. For a scenario of 1024 critical paths 
with the logic depth of n = 8, the clock period has to be 
set to 1.27 X <^h«it to reach a parametric yield of 90% 
instead of 1.21 x ^th*"' regarding only the global effects. 
For values of ^t«i^ = 15% (0.18 /an technology at Vdd = 
2.1 X Vth in Rg. 1 1) the clock period increases significantly to 
1.39 x.^ti,«rit. Even for o^^^^ = 3% (0.5 /xm at Vdd = 
2.9xKh) the clock period amounts to 1.25x5path<^t (Fig. 11). 
The values for a 90% parametric yield are shown in Table II. 

This results demonstrate that the impact of local delay 




Fig. 12. Standard six transistor SRAM cell. For reading, the precharged 
bitlines BL or BLQ have to be discharged via MNl or MN2. For MNl and 
MN2 a width of two times the minimal gate length is used. 
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Fig. 13. DeJay vmsnons ^c^i^^ of SRAM cell delay rfe«ll due to local 
parameter variations. Results for a 0.5 tira (simulated) and for a 0.18 |im 
(estimated) technology are given. 



variation on yield evaluations depends strongly on the path 
spectrum, i.e., the number of critical paths and their logic 
depth. 

A further example for functional blocks with a regular 
structure are embedded SRAM*s. Bit failures due to T^h' 
variations can occur for four transistor cell structures and are 
investigated in [10]. For low power applications, six transistor 
cell [11] structures are used (Fig, 12) which are immune to 
soft errors and where the static power dissipation is low. 
Another characteristic of a SRAM is the access time for 
reading data. This is detennined by the critical path comprising 
the address decoder, the cell, and the sense amplifier. Due to 
the small device dimensions, the most sensitive part to local 
parameter variations is the cell delay dceu- The delay t^ceU 
denotes the time for discharging the precharged bitlines BL 
or BLQ via transistor MNl and MN5 or MN2 and MN6 
respectively (Fig. 12). SPICE simulations show, that dc^u 
and its variations are clearly dominated by the properties 
of the transfer transistors MNl and MN2. The influence of 
local parameter variations on the distribution function of dceti 
is investigated for Scrg^Lte = 50%,Vbz> = 2.lVth„, and 
^A/ATi equals two times the minimum gate length L, Using 
the experimental determined Kii-variations of the NMOS 
transistor with the specific gate area, the delay variations 
^ceiiioc determined by means of SPICE simulations (see 



90% 





r yny - 


^ |only global effects] // / 

1 1 1 Jy /. — \ / I \ 


[TmI 



1.20 



1.24 1.28 1.32 



L36 



1.40 



Fig. 14. Distribution functions for d^, nonnaUzed to dceii of SRAM': 
with 8K, 32K. and IM are shown with local effects (<7^ii. ^ = 2.7% 
corresponding to a 0.5 //m technology) and with regarding globaf effects only 
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Fig. 15, Distribution functions for nonnalized to dceii of SRAM's wit 
8K. 32K^ and IM are shown with local efEieccs o-^. = 8%, correspoodin: 
to a 0.18 itva technology) and with regarding globafeffects only. 

Fig. 13). The cell delay needed for yield of 90%, i 
calculated for different sizes of SRAM's, where the numbe 
of critical paths equals two times the size of the memory. For . 
0.5 |im technology v^ith <t^h^^ = 2.7%, the ratio of d^u/^ 
for sizes of 8K. 32K. and IM is shown in Fig. 14. In tb- 
case of a IM SRAM, amounts to 1.35 x d^vL compare^ 
to 1.21 X dceu when local effects are neglected. F6r ftitur 
technologies the effects increase significantly. For a O.IS /xi 
technology, is calculated to 1.60 x dceii to achieve th 
90% yield for a IM SRAM (Fig. 15). The exact values ar 
given in Table IE. 

As the results shown above demonstrate an increased infli 
ence of local delay variations with decreasing logic depth an 
increasing number of c critical paths. local effects have to b 
taken into account in designing pipelined circuits. 

Neglecting register delay, with doubling the number c 
pipeline stages the critical path length is halved and th 
coiresponding number c of critical paths is doubled. Bot 
lead to an increased effect of local delay variations on th 
perfonnance. 

As an example, the performance gain is determined fc 
the 32-bit multiplier assuming 1, 2, 4, 8, and 16 pipelir 

stages with the critical paths comprising 68, 36, 20, 12, and 



366 



IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. VOL 5. NO. 4. DECEMBER 1997 



TABLE II 

Raho of Clock Period to Crthcal Path Delay (and im Increased Ctooc Period in Percentage 
COMPARED to Regarding only the Global Effbcr) to Achieve a Parametwc Yield of 90% 





c = 


34. n = 68 


tcik/^'path"'* for c critical path 
|c=100.n = 36 


\c = 


1024.n = 8 


0% 




T.21 


(0%) 


1.21 


(0%) 


1.21 


(0%) 


3% 


(0.5 fim®2.9V^h) . ... 


1.21 


(0%) . 


1.22 


(0.8%) 


1.25 


(3.3%) 


5% 


(0.5 /iiii<92.li'th) 


1.23 


(1.7%) 


1.24 


(2.5%) 


1.27 


(5.0%) 


15% 


(o.ts finfmAVth) 


1.25 


(3.3%) 


1.27 


(5.0%) 


1.39 


(14.9%) 



100% 
96 % 

% 

O" 92% 



0.5^771 @ 2.9 Vth 



p.5/im@2.lVtfc 



0.18Mro ft 2.1 \^ 




C p^ihs: 34 
n gates/path: 68 
pipeline stages: 1 



100 
36 
2 



232 
20 
4 



496 
12 
8 



1024 
8 
16 



Fig. 16. Percentage of performance gain achieved for pipelining regarding 
local delay variations G^io+i€>c compaied to regarding only global effects 
G^io- For values of '^Jate,^^ — •'• 5, 15% the example of pipelining 32 paths 
with n = 68 gates into c = 1024 paths cominising eight gates is shown. 



TABLE III 

Ratio of the CIell Delay d^,, to deeii (and the Inoiease 
IN Percentage Commred to Regarding only thb Global 
Effects) to Achieve a Parametric Yeeldof 90% 





c = 


2*6 K 


c= 2 


♦ 32 K 


C := 


2* IM 


0% 




i.2i 


(0%) 


1.21 


(0%) 


121 


(0%) 


2:7% 


(03 timm.lV,0 


1-32 


(9%) 


1.33 


(10%) 


1.35 


(12%) 


8.0% 


(0.18 Mm®2-1 Vth) 


1.52 


(26%) 


i.55 


(28%) 


1.60 


(32%) 



gates. The perfonnance gain C?gio+ioc with ^^Jate.^ = 59^> 
and 15% normalized to the performance gain Ggio regarding 
only global effects is shown in Fig. 16. For a cr^t^^^ = 5% 
the perfonnance gain is reduced by 4%, for cr^t^ — ^5% 
the reduction amounts to 10%. Therefore, the effect of local 
variations has to be checked when the optimum number 
of pipeline stages for a high performance architecture is 
determined. 

VI. Conclusion 

In this work we have presented the influence of local delay 
variations due to Fth-variations on path delays and on the 
design of low voltage digital circuits. 

Motivated by SPICE simulations and analytical results 
which show relative gate delay variations due to local Kb* 
variations of up to <^gaie,^^ = 5% of inverters with miniinal 
feature size operating at low supply voltage, a test circuit for 
the determination of path delay variations was developed. 

Measurements, taken for paths with different logic depths, 
verified the increase of the relative path delay variations for 
reduced logic depths, smaller dimensions, and reduced supply 
voltages. Circuits fabricated with a reduced threshold voltage 



show reduced delay variations when operating at the same 
Vdd- 

Due to the local variations, yield and performance of low 
voltage circuits depend on the number and the logic depth of 
the critical paths. Therefore, the influence on the yield and the 
performance was investigated for pipelined architectures and 
SRAM*s. A scenario of 1024 critical paths with a logic depth 
of n = 8 gates shows already significant effects for an actual 
0.5 fiia technology. For example for <^gatei^ = 5%, the ratio 
of clock period to the nominal critical path^delay is increased 
from 1,21 to 1.28 to achieve a parametric yield of 90% 
coQipared to designs operating at conventional supply voltages* 
where die yield is detenhined % highly;^(^ 
gate delay variations. For SRAM circuits, local parameter 
variations have a significant impact on the ctsill delay. A 
scenario for three different SRAM*s was investigated, showing 
that even for a current 0.5 fjim technology in a 1 M SRAM 
the specification of the critical path delay has to be increased 
by 12% of the cell delay to achieve a yield of 90%. 

In scenarios for future technologies the influence of Kh- 
variations grows. For a 0.18 /un technology, die margin to 
achieve the 90% yield will rise from. 1^1 to 1.39. 

(Generally, for all signals, which are critical to skew like 
clock drivers or input stages of latches or flip flops, no minimal 
devices should be used. For the ^namic testing of circuits, 
the test of a single critical path is not sufficient any more to 
guarantee the function of the total circuit or to classify the 
circuit to a certain performance window. The tradeoff for a 
reduced Vth between higher performance and increased power 
dissipation due to leakage is extended to the aspect of reduced 
gate delay variations. 

Further research will be done for low voltage memories 
which can comprise a high number of critical paths and for 
techniques with dynamic threshold voltage, to evaluate die 
influence of the Viu-variations. 



APPENDIX 

i4. Variances of Gate and Path Delays 

The probability distribution function pdf of the gate delay 
can be modeled by a normal distribution function N{mijO^) 
fully characterized by its mean value mi = dgate and its 

t2 



variance '= (a^te) by 



A«c)V(<r»..ic)'l 



(10) 
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The pdf of a path comprising n gates corresponds to the linear 
combination of the n pdfs of the gate delays. The average path 
delay is given by 



i 

With the symmetrical covariance matrix C 



(11) 









* ' • Cm' 


c 




C21 


■ ■ C2n 


<nXn) 




-Cnl 






— CTg^te^ 




Cii 







and 



pij-: correlation between two gates 



(12) 



the variance of the path can be expressed as 






Oil 


• Q12 


n n 


(13) 




G21 


C22 



n n 



(14) 



l/^/n). For the latter case, the relative path delay variation 
, are constant as they are usually considered in the design wit) 
conventional supply voltages. 

With both global and local variations, the pdf of the gate de 
lay is given by the linear combination of -^gio(^^ate,<?^gate^,^ 

and ^k)c(0,agate^„) to ^gl<H-loc{^ate, <Tgate,,,+,, J wiU 

^s^^^iio+uc according to (14) and (15) 

The calculation of the variance of the path delay regarding 
local as well as global effects is demonstrated for an example 
with two gates. For that case, the correlation matrix C^<mxm> 
can be expressed with m := 2n as 



(22: 



where the submatrixes (simplified for gate 1 with aj^^ anc 
aj^ and for gate 2 with o^^^ Jmd ^® 



For Pij = Pf i.e.,.the inteigate correlation is the same for all 
gates, and with the relative gate delay variation 

^gate = <^gate/3gate- (15) 

Equation (13) can be simplified to 

n n . 
i 3 

n n 

= X^^^eate^ "<^le. ' P * <^te, ' <^gate^ 
t j 



0 
0 



0 
0 



and the correlation coefficients between the gates are 

PgIogk>=l, Piocglo=0, Pgloloc = 0> 

P\oc k>c = 0. 



(23) 



The variance (^^path^^,^)^ of the path can be calculated 
according to (14) with 



7n = 2. 



(24) 



Por ai,,^ = ^2,,^ = agio and with (17) and (19) this expression 
is simplified to 



P(dp.th)* + (1-P)£(dgate,)2 



for agate, = ^Jate- 



(16) 



(25) 



Of interest are the two cases p = 0 for local, i.e., uncorrelated 
variations (aj^^e = ^^gate,^^) P = 1 for the occurrence of 
only global, i.e., correlatoi variations (ajat© = ^^gate^j.) 



For a path with n gates and for dgate, = <^te and a^j^^ = 
ajate,^^ the standard deviation of a path can be simplified to 



(17) 



P = l: <7p,th =ffgate,„«W»« 

<';.tJ.=<^J«e.,- (20) 

The first case clarify, that with p = 0 the relative path delay 
variation is reduced for longer paths (for identical gates with 



= apath-<^^th.^^.^- (26) 

For example, with increasing logic depth, the impact of the 
local variations is reduced. 
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